﻿ A Hierarchical Account of Referential Accessibility Nancy IDEDan CRISTEA Department of Computer ScienceDepartment of Computer Science Vassar CollegeUniversity “Al I Cuza” Poughkeepsie, New York 12604-0520 USAIasi, Romania ide@cs vassar edudcristea@infoiasi ro Abstract1 Veins Theory In this paper, we outline a theory ofVeins Theory (VT) extends and formalizes the referential accessibility called Veinsrelation between discourse structure and Theory (VT) We show how VT addressesreference proposed by Fox (1987) VT the problem of "left satellites", currently aidentifies “veins” over discourse structure trees problem for stack-based models, and showthat are built according to the requirements put that VT can be used to significantly reduceforth in Rhetorical Structure Theory (RST) the search space for antecedents We also(Mann and Thompson, 1987) RST structures show that VT provides a better model forare represented as binary trees, with no loss of determining domains of referentialinformation Veins are computed based on the accessibility, and discuss how VT can beRST-specific distinction between nuclei and used to address various issues of structuralsatellites; therefore, RST relations labeling ambiguity nodes in the tree are ignored Terminal nodes in the tree represent discourse units and non- Introductionterminal nodes represent discourse relations In this paper, we outline a theory of referentialThe fundamental intuition underlying VT is accessibility called Veins Theory (VT) Wethat the distinction between nuclei and compare VT to stack-based models based onsatellites constrains the range of referents to Grosz and Sidner's (1986) focus spaces, andwhich anaphors can be resolved; in other show how VT addresses the problem of "leftwords, the nucleus-satellite distinction induces satellites", i e , subordinate discourse segmentsa domain of referential accessibility (DRA) for that appear prior to their nuclei (dominatingeach referential expression More precisely, for segments) in the linear text Left-satellites poseeach anaphor a in a discourse unit u, VT a problem for stack-based models, whichhypothesizes that a can be resolved by remove subordinate segments from the stackexamining referential expressions that were before pushing a nuclear or dominatingused in a subset of the discourse units that segment, thus rendering them inaccessible Theprecede u; this subset is called the DRA of u percentage of such cases is typically small,For any elementary unit u in a text, the which may account for the fact that theircorresponding DRA is computed automatically treatment has been largely overlooked in thefrom the text's RST tree in two steps: literature, but the phenomenon nonetheless1 Heads for each node are computed bottom- persists in most texts We also show how VTup over the rhetorical representation tree can be used to address various issues ofHeads of elementary discourse units are structural ambiguity the units themselves Heads of internal nodes, i e , discourse spans, are computed by taking the union of the heads of thefind their referees within that sub-text immediate child nodes that are nuclei ForReferees that do not appear in the DRA are example, for the text in Figure 1,1 with thepossible, but are more difficult to process, both rhetorical structure shown in Figure 2,2 thecomputationally and cognitively (see Section head of span is unit 5 Note that the2 2) This conjecture expresses the intuition head of span is the list becausethat potential referees of the REs of a unit both immediate children are nuclei depend on the nuclearity of previous units: 2 Using the results of step 1, Veinboth a satellite and a nucleus can access a expressions are computed top-down forprevious nuclear node, a nucleus can only each node in the tree, using the followingaccess another left nuclear node or its own left functions:satellite, and the interposition of a nucleus 1, to find a co-all This accounts for the marked divergence in reference, but a co-referent is found using theeffort shown in Figure 6 as k increases Although stack-based method by searching back m non-our corpus is necessarily small, these results left satellite units, where m , whereas in the rhetorical representation, the difficulties are overcome, this situation leads tovein is That is, under the constraints the postulation of cataphoric references when aimposed by VT, John is not available as a satellite precedes its nucleus, which is counter-referent for he in C6 in the rhetorical version, intuitive although John is clearly the appropriate antecedent Interestingly, the intention-based 3 VT and Structural Ambiguityanalysis is skewed to the right and thus is a The fact that VT considers only the nuclear-"better" representation according to the criteria satellite distinction and ignores rhetoricaloutlined in (Marcu, 1996); it also eliminates the labeling has practical ramifications for anaphoraleft-satellite that was shown to pose problems for resolution systems that rely on discoursestack-based approaches It is therefore likely that structure to determine the DRA for a given RE in this case, the intention-based analysis is (Marcu, et al , 1999) show that over a corpus of"better" for the purposes of anaphora resolution texts drawn from MUC newspaper texts, the (6) A6 Tell John to bring the car home by 5 Wall Street Journal corpus, and the BrownB6 That way I can get to the store before it closes Corpus, reliable agreement among annotators isC6 Then he can finish the bookshelves tonight consistently obtained for discourse segmentation and assignment of nuclear-satellite status, whilecondition agreement on rhetorical labeling was lessA6-B6 reliable (statistically significant for only theconditionC6 MUC texts) This means that even when there exist differences in rhetorical labeling, veinA6B6 expressions can be computed and used toFigure 8: RST tree for text (6), using rhetorical determine DRAs relations VT also has ramifications for evaluating the viability of different structural representationsmotivation for a given text, at least for the purposes ofB6-C6 reference resolution Like syntactic parsing,motivationA6 discourse parsing typically yields several interpretations, and one of the a priori tasks forB6C6 further analysis of the parsed texts is to choose one from among potentially several alternativeFigure 9: RST tree for text (6), using intention-based relations structures Marcu (1996) showed that using only rhetorical relations, as many as five different Conclusion structures can be identified for some texts Considering intention-based relations can yieldVeins Theory is based on established notions of even more alternatives For anaphora resolution,discourse structure: hierarchical organization, as the choice of one structure over another mayin the stack-based model and RST's tree have significant impact For example, an RSTstructures, and dominance or nuclear/satellite tree for (6) using rhetorical relations is given inrelations between discourse segments As such, Figure 9; Figure 10 shows another RST tree forVT captures and formalizes intuitions about discourse structure that run through the currentliterature VT also explicitly recognizes theMarcu D , E Amorrortu, and M Romera (1999) special status of the left satellite for discourseExperiments in Constructing a Corpus of Discourse Trees Proceedings of the ACL’99 Workshop on structure, which has not been adequatelyStandards and Tools for Discourse Tagging addressed in previous work Marcu, D (2000) Extending a Formal and Computational In this paper we have shown how VT addressesModel of Rhetorical Structure Theory with Intentional à la Grosz and Sidner Proceedings of the 18th the left satellite problem, and how VT can beStructures International Conference on Computational Linguistics, used to address various issues of structural523-29 ambiguity VT predicts that references notMarcu, D (1999) A Formal and Computational Synthesis resolved in the DRA of the unit in which itof Grosz and Sidner's and Mann and Thompson's Workshop on Levels of Representation in appears are more difficult to process, boththeories Discourse, 101-108 computationally and cognitively; by looking atMarcu D (1996) Building Up Rhetorical Structure Trees cases where VT fails we determine that thisThe Proceedings of the Thirteenth National Conference claim is justified By comparing the types ofon Artificial Intelligence, vol 2, 1069-1074 Mitkov, R (1997) Factors in anaphora resolution: they are referring expressions for which VT and the not the only things that matter A case study based on stack-based model fail, we also show that VTtwo different approaches Proceedings of the provides a better model for determining DRAs ACL/EACL’97 Workshop on Operational Factors in Practical, Robust Anaphora Resolution, 14-21 AcknowledgementsMoser, M and J Moore (1996) Towards a Synthesis of Two Accounts of Discourse Structure Computational We thank Daniel Marcu for providing us withLinguistics, 18(4): 537-544 the RST annotated MUC corpus, and ValentinSidner, C (1981) Focusing on the Interpretation of Tablan for developing part of the software thatPronouns Computational Linguistics, 7:217-231 enabled us to process the data References Azzam S , K Humphreys and R Gaizauskas (1998) Evaluating a Focus-Based Approach to Anaphora Resolution Proceedings of COLING-ACL’98, 74-78 Cristea, D , Ide, N , and Romary, L (1998) Veins Theory: A Model of Global Discourse Cohesion and Coherence In Proceedings of the 36th Annual Meeting of the ACL and of the 17th COLING (COLING/ACL’98), 281-285 Cristea, D , Ide, N , Marcu, D , and Tablan, V (2000) An Empirical Investigation of the Relation Between Discourse Structure and Co-Reference, in Proceedings of Coling 2000, 208-214 Fox, B (1987) Discourse Structure and Anaphora Written and Conversational English No 48 in Cambridge Studies in Linguistics, Cambridge University Press Grosz, B and Sidner, C (1986) Attention, Intention and the Structure of Discourse Computational Linguistics, 12, 175-204 Gundel, J , N Hedberg, and R Zacharski (1993) Cognitive Status and the Form of Referring Expressions Language, 69:274-307 Hahn U and M Strübe (1997) Centering in-the-large: Computing referential discourse segments Proceedings of ACL-EACL’97, 104-111 Hirschman L and N Chinchor (1997) MUC-7 Co- reference Task Definition Mann, W C , Thompson S A (1988) Rhetorical structure theory: A theory of text organization, Text, 8:3, 243-281 